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INTRODUCTION 


Social  psychological  research  problems  sometimes  require  collection 
of  data  on  intragroup  communication.  The  investigator  must  know  who 
talked  to  whom  and  how  frequently.  Two  factors  which  have  the  potential 
for  complicating  the  collection  of  such  data  are  group  size  and  type  of 
communication  pattern.  Chappie,  Bales  and  many  others  have  collected  such 
data,  but  usually  in  groups  either  so  small  or  so  orderly  that  one  observer 
could  monitor  the  entire  group  simultaneously. 

If  subjects  are  to  be  identified  individually,  then  size  is  an  important 
constraint.  The  observer  must  quickly  be  able  to  identify  every  subject 
individually.  The  larger  the  group,  the  more  difficult  it  is  to  do  this. 

No  doubt  there  is  some  upper  limit  on  size  (so  far  as  I know,  unknown)  for 
ready  identification  by  the  average  observer.  Size  also  effects  geographical 
area.  Larger  groups  require  a larger  area  and  constrain  visual  recognition 
across  distances. 

Even  more  important  than  size  is  communication  pattern.  For  instance, 
a very  large  lecture  class  can  be  observed  so  long  as  the  lecturer  is  the 
focus  of  communication.  The  lecturer's  communicative  acts  and  those 
addressed  to  him  by  auditors  can  be  coded  without  great  difficulty.  When 
such  a pattern  breaks  down  and  spontaneous,  simultaneous  dialogue  begins 
within  many  dyads  most  category  systems  interpret  this  as  a manifestation 
of  disorder  and  provide  a residual  category  to  describe  it  such  as  "silence 
or  confusion"  (Flanders,  1970,  34). 


-2- 


There  are  many  natural  groups  in  which  spontaneous,  simultaneous 
communication  within  many  dyads  is  not  only  normal  but  a manifestation 
of  order,  not  disorder.  In  a large  office  many  separate  conversations 
occur  simultaneously.  This  is  also  true  of  a party.  Whatever  the 
group  or  situation,  multiple  dyadic  interaction  requires  the  observer 
to  adopt  a different  stragegy  of  data  collection  than  is  used  in  small 
groups  or  groups  with  a centralized  communication  pattern. 

THE  PROBLEM  IN  THE  NAVY  STUDY 

The  focus  of  the  Navy  study  was  the  influence  of  proximity  and 
similarity  on  informal  group  formation.  As  in  the  Sherif  studies  I 
wished  to  study  the  emergence  of  informal  groups  in  a natural  setting. 

It  was  necessary  to  record  the  frequency  of  interaction  of  individuals 
as  these  smaller,  informal  groups  differentiated  within  a larger  one,  and 
be  able  to  document  the  emergence  of  patterns.  One  way  to  do  this  was 
to  observe  subjects  in  training  units  varying  in  size  from  30  to  60  men 
interacting  spontaneously  and  simultaneously  in  the  compartment  (barracks 
wing)  in  which  they  lived. 

Figure  1 is  a sketch  of  a typical  compartment  of  such  a unit.  The 
men  were  assigned  to  double  racks  (bunks)  (for  typical  examples  see 
Figures  2-7).  Each  compartment  held  a maximum  of  80  men  in  40  racks. 

In  addition  to  the  main  compartment  itself,  there  was  a small  office 
and  entrance  hallway  to  the  fore.  Aft,  the  compartment  connected  to  a 
lounge  where  men  could  view  TV  or  use  pop  machines.  At  the  fore  of  the 
compartment,  on  the  starboard  side,  were  doors  leading  to  the  head, 
showers  and  drying  room.  Each  unit  spent  free  time  from  1530  until  taps 


Office 
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Figure  1:  Typical  compartment 

for  Navy  apprenticeship 
training  unit  by  subject  number. 
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Figure  4:  A perspective  down  the  center  of  the  compartment. 

Notice  the  tables  where  men  may  read,  write,  play  cards  (contrary 
to  regulations)  or  sew. 


Figure  5:  A side  view  of  the  compartment  from  the  perspective 

of  an  upper  rack.  This  is  a typical  view  of  observers  when 
collecting  interaction  data. 


Figure  6:  The  passageway  along  the  starboard  bulkhead.  Mo 
observations  were  taken  from  the  vantage  of  this  relatively 
inconspicuous  and  little-used  location. 


Figure  7 : The  smoking  and  television  lounge  attached  to  the 

compartment.  Notice  the  television  set  and  Coke  machine,  the 
foci  of  activities.  Many  units  spent  well  over  a quarter  of 
their  free  time  watching  television. 
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(2130)  within  this  area.  Interaction  patterns  developed  within  a new 
training  unit  during  their  free  time.  Interaction  was  usually  spontaneous 
and  simultaneous  among  several  different  groups  of  trainees. 

Because  of  their  number,  physical  location  and  the  decentral ized 
communication  pattern  it  was  not  possible  to  observe  all  the  men 
simultaneously.  It  was  necessary  to  develop  a sampling  design  based  on 
a knowledge  of  the  size  of  sample  necessary  for  analysis  and  inference 
as  estimated  by  Larntz  (see  the  other  paper  included  with  this  report). 

For  reasons  stated  below  it  seemed  important  to  collect  data  on  interpersonal 
communication  within  other  areas  of  the  training  battalion  including  the 
galley,  classroom  and  classroom  lounge  (see  Figures  8-9)  where  interaction 
was  also  simultaneous  and  spontaneous.  Here  too,  it  was  necessary  to 
develop  a sampling  design. 

SAMPLING  THE  COMPARTMENT 

In  developing  the  design,  consideration  was  given  to  several  al ternatives . 
In  a previous  study  six  subjects  in  a group  of  80  had  been  sampled  on  an 
ordered,  rotation  basis.  Using  a list  of- the  six,  the  observer  had  located 
the  first  subject  on  the  list  and  recorded  his  communicative  behavior  at  the 
time  located,  then  the  second,  and  so  on.  When  the  sixth  subject  was  coded 
the  observer  started  again  at  the  beginning  of  the  list.  All  six  were  coded 
within  a time  limit.  This  method  entailed  successive  search  within  a 
geographical  area  containing  several  rooms,  one  of  which  was  about  160  feet 
long  and  contained  many  people.  Finding  six  subjects  was  very  time  consuming. 
Successively  locating  an  entire  unit  would  have  been  even  more  time 
consuming  and  would  have  limited  the  total  number  of  samples  to  a small 
number.  Any  method  involving  search  for  individuals  was  impractical. 


Figure  8:  The  galley  where  apprentices  ate  their  meals.  They 

seated  themselves  at  vacancies  at  these  tables  after  going 
through  a cafeteria  line.  The  unit  marched  to  the  galley  as 
a group. 


Figure  9:  One  of  the  classrooms.  Each  unit  was  in  training 

for  ten  curriculum  days. 
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The  alternative  we  choose  was  a form  of  area  sampling.  The  compartment 
and  lounge  were  divided  into  ten  sampling  areas  (see  Figure  10).  Each  area 
was  delimited,  based  on  pretesting,  considering  as  a criteria  wrat  size 
geographical  area  could  normally  be  quickly  scanned  by  an  observer  to 
determine:  a)  who  was  in  the  area;  b)  if  someone  was  in  the  area,  whether 
interaction  was  occuring;  and  c)  if  interaction  was  occuring,  then  with 
whom? 

It  was  not  practical,  to  divide  the  area  into  units  of  equal  size. 

Certain  large  areas  were  seldom  used,  while  some  small  areas  were  intensively 
used.  In  Figure  10  the  numbered  sampling  areas  are  displayed.  Dividing 
lines  between  areas  went  through  the  center  of  racks.  This  may  at  first 
seem  surprising,  but  is  due  to  the  fact  that  pretests  showed  that  subjects 
lying  flat  on  their  backs  in  the  middle  of  racks  seldom  interacted.  They 
were  more  apt  to  sit  up  on  one  side  of  the  rack  or  the  other  when  talking 
to  another  subject. 

One  study  unit  occupied  about  half  the  racks  in  the  compartment. 

All  the  racks  in  Area  Seven  were  unoccupied.  While  subjects  occasionally 
used  this  area,  activity  was  not  great  considering  its  size.  Similarly 
Area  One  was  larger,  including  the  entrance  passageway  and  a small  office. 
Much  interaction  took  place  in  the  equally-sized  Area  Two  through  Six. 

By  walking  down  the  starboard  isle  next  to  the  bulkhead  the  observer 
could  inspect  one  sampling  area  at  a glance.  He  then  coded  the  area: 
a)  empty  of  subjects;  or  b)  if  occupied  he  identified  the  occupants  and 
v/hether  or  not  they  were  interacting;  and  c)  if  they  were  interacting  he 
coded  the  interaction  partner(s). 
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The  hallway  inco  the  lounge  was  considered  a sampling  area  (Area  8). 

The  lounge  was  divided  into  two  areas.  The  chairs  were  lined  up  in  several 
rows  facing  the  television,  which  was  mounted  in  a corner.  A more  or  less 
diagonal  line  divided  rows  of  chairs  closest  to  the  television  from  rows 
in  the  back.  One  sampling  area  in  the  lounge  included  that  part  of  the 
area  close  to  the  television;  the  other  that  part  which  was  more  distant. 
While  these  two  areas  were  often  crowded  the  attention  of  the  subjects 
was  usually  focused  on  the  television.  This  focus  effectively  constrained 
interaction  so  that  the  observer  could  readily  code  for  each  area. 

After  the  areas  were  delimited  random  lists  (without  replacement)  of 
the  ten  areas  were  generated.  After  pretesting  it  was  found  that  it  was 
possible  to  sample  all  ten  areas  in  15  minutes.  The  observers  were 
instructed  to  sample  the  ten  areas  in  random  order  for  15  minutes.  They 
would  then  sample  the  ten  areas  the  next  15  minutes  using  a different 
random  order.  Sampling  was  carried  out  each  weekday  from  1530  to  2130  on 
the  15  minute  schedule  with  a half  hour  break  for  chow.  It  was  possible 
to  get  from  18  to  22  samples  per  day  using  this  schedule. 

It  was  necessary  to  enunciate  several'  conventions  for  observers  to 
follow  in  utilizing  this  design.  The  first  convention  pertained  to  those 
lying  in  the  middle  of  their  racks  and  therefore  in  the  middle  of  the 
imaginary  line  dividing  sampling  areas.  Such  a person  was  always  considered 
to  be  in  the  area  aft  of  the  dividing  line.  Thus,  a person  lying  in  the 
middle  of  the  first  rack  was  considered  to  be  in  area  two. 

The  second  convention  pertained  to  interarea  interaction.  For  obvious 
reasons  it  was  necessary  to  avoid  counting  a subject  twice,  if  for  instance, 
two  subjects  were  conversing  across  the  boundary  of  an  area.  Since  both 
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speaker  and  listener  were  coded  it  would  have  been  possible  to  code  both 
present  in  each  area,  thus  compromising  the  independence  of  each  count. 

To  avoid  this,  when  interarea  interaction  was  occuring  it  was  coded  o^ly 
if  the  other  subject  was  to  the  aft  of  the  area  being  coded.  Thus,  if 
subject  A in  Area  Two  talked  to  subject  B in  Area  Three,  when  Area  Two  was 
sampled  the  interaction  would  be  toward  the  aft  and  therefore  coded.  When 
Area  Three  was  sampled,  since  the  same  dyad  would  be  interacting  toward 
the  fore  it  was  not  counted.  Counting  the  same  interaction  twice  was  thus 
avoided  with  this  procedure. 

Finally,  by  convention  (or  definition)  we  utilized  a somewhat  broader 
definition  of  "interaction"  than  many  have  used.  Often  "interaction"  is 
limited  to  focused  verbal  behavior.  There  are  several  other  essentially 
symbolic  activities  which  are  focused,  but  which  may  not  involve  continuous 
overt  verbal  behavior.  In  the  first  place  interacting  includes  not  just 
talking,  but  active  listening.  Since  the  observer  scanned  an  area,  taking 
a fix  on  its  occupants  and  whether  or  not  they  were  interacting,  he  did  not 
listen  to  a sequence  of  turn  taking  in  conversation.  The  observation  of 
each  area  was  somewhat  like  a snapshot,  recording  interaction  as  of  that 
moment.  Therefore,  listening  was  considered  as  important  as  speaking.  The 
first  code  indicated  the  speaker.  The  subsequent  code  specified  the  active 
listener(s). 

Secondly,  mutually  focused  symbolic  activities  in  which  two  or  more 
engaged  such  as  playing  cards  or  craps  were  considered  to  be  interaction. 
Purely  audience-type  activities  were  not  considered  interaction.  To  qualify 
as  interaction  there  had  to  be  mutual  participation  in  some  common  focus  of 
interest  which  involved  active  participation  by  the  subjects.  Since,  in  this 
context,  this  unit  was  not  a work  group,  this  enabled  us  to  code  quite 
important  forms  of  leisure  interaction  not  limited  to  overt  verbal  behavior. 
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In  Figure  11  is  a printout  from  one  observation  period.  Figure  12 
is  the  matrix  of  counts  from  a full  observation.  The  cells  on  the  diagonal 
indicate  the  number  of  times  the  subject  was  coded  as  physically  present 
in  the  compartment  in  some  sampling  area  but  not  interacting.  The  off 
diagonal  cells  indicate  the  interaction  counts  with  other  subjects.  Only 
the  upper  right  triangular  matrix  is  necessary  because  speaking  is  counted 
equivalent  to  listening.  The  count  is  symmetric.  Marginals  may  be  unequal 
for  several  reasons,  not  only  due  to  normal  sample  variation,  or  observer 
error  but  due  to  subject  absence  from  the  compartment.  For  obvious  reasons 
of  privacy  the  head,  shower  and  drying  room  areas  were  off  limits  to  the 
observers.  Subjects  might  also  be  on  liberty  during  at  least  part  of  the 
period  of  observation  (see  "Liberty  Sections"  below)  or  occasionally, 
somewhere  else  in  the  barracks. 

It  was  possible  for  the  observer  to  code  a subject  twice  during  a 
15  minute  sampling  period  if  the  subject  physically  moved  from  one  area 
to  another  during  that  time.  The  second  coding  indicated  a different 
conversation  from  that  coded  in  the  first  area,  and  therefore  was  at  least 
partially  independent  of  the  first. 

CLASSROOM  SAMPLING 

In  every  day  life  as  one  sequentially  occupies  one  position  after  another  - 
family  member,  student,  employee,  customer  - one  moves  from  one  set  of  proxemic 
relations  to  another.  Most  studies  have  taken  only  one  set  of  such  relations 
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Figure  12: 

A Matrix  of  Counts  from  a Full  Observation 
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into  account.  For  instance,  Festinger,  Schachter  and  Back  took  into  account 
only  the  proxemic  relations  in  the  housing  project.  They  did  not  take 
into  account  the  relations  the  respondents'  husbands  had  as  students  and 
employees  during  the  many  hours  a day  they  were  not  at  home.  During 
apprenticeship  training  only  two  such  sets  of  relations  v/ere  fixed  by 
assignment  and  occupied  for  considerable  periods  of  time  each  day: 
compartment  rack  order  and  classroom  desk  order. 

In  designing  the  study,  I considered  whether  these  orderings  should  be 
different  or  as  alike  as  possible.  The  effect  of  very  different  orderings 
was  interesting.  There  is  little  attention  in  the  literature  to  sequential, 
multiple  orderings.  This  would  have  constituted  an  additional  experimental 
condition  impossible  to  accomodate  within  the  overall  design  since  it  would 
have  resulted  in  too  many  groups.  I decided  that  while  it  was  impossible 
to  maintain  classroom  ordering  equivalent  to  that  in  the  compartment  we 
should  seek  to  maintain  it  as  closely  as  possible.  Since  informal  observation 
indicated  covert  interaction  occured  mostly  within  rather  than  between  rows 
the  ordering  was  maintained  within  rows.  A typical  ordering  is  displayed  in 
Figure  13.  Rows  were  numbered  from  left  to  right  (facing  the  front  of  the 
classroom).  Subject  1 was  assigned  to  the  front  seat  in  Row  1,  subject  2 to 
the  seat  behind,  and  so  on.  The  next  row  v/as  filled  from  the  back  so  that 
the  next  numbered  subject  after  the  last  in  Row  1 was  directly  across  the 
isle  in  Row  2. 

Because  our  primary  focus  was  on  informal,  spontaneous  interaction  it 
was  necessary  to  observe  apprentices  very  carefully  in  the  classroom.  For 
the  most  part  their  training  consisted  of  six  to  eight  hours  of  lectures 
per  day.  They  were  forbidden  to  talk  to  each  other  during  lectures. 
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Figure  13: 

A Typical  Classroom  Desk  Ordering 
by  Subject  Number 
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Communication  was  covert  and  fleeting,  often  whispered  exchanges  during 
lectures  or  tests.  Since  communication  was  covert  it  was  difficult  to 
code  if  the  entire  class  was  monitored  at  one  time.  Again,  we  decided 
to  sample,  this  time  by  rows.  The  observer  was  given  a randomly  ordered 
list  of  rows  (wi thout  replacement) . He  would  observe  the  first  row  in  the 
ordering  for  five  minutes,  then  the  next,  and  so  on.  When  all  the  rows 
had  been  observed  he  would  start  again  using  another  ordering.  The  observer 
sat  at  the  back  of  the  classroom  to  make  these  observations. 

Students  were  ordered  to  stand  in  the  back  of  the  classroom  if  they 
thought  they  might  go  to  sleep  in  class.  It  was  not  unusual  at  any 
particular  time  for  several  students  to  be  standing.  If  a student(s) 
from  the  row  being  observed  was  standing  in  the  back  then  he  was  included 
in  that  sample. 

THE  CLASSROOM  LOUNGE 

Periodically,  but  at  irregular  intervals,  a class  was  given  a smoke 
break.  Apprentices  went  to  a large  room  in  which  there  were  four  high 
narrow  benches  on  which  ashtrays  were  located.  Rules  about  throwing  ashes 
or  cigarettes  on  the- floor  were  very  strict  so  they  clustered  around  these 
benches  to  talk  and  smoke.  There  were  no  chairs.  During  short  breaks 
observers  coded  these  clusters  distinguishing  separate  conversation  groups 
within  them  when  such  differentiation  was  possible.  They  did  not  code 
interaction  but  rather  the  apparent  physical  clustering  of  the  subjects 
into  interacting  groups.  Since  this  clustering  was  by  choice,  I thought 
it  might  give  some  clue  to  informal  relations. 
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GALLEY  SAMPLING 

The  base  was  served  by  a very  large  galley.  Since  nearly  two  thousand 
sailors  had  to  be  fed  at  each  meal,  scheduling  was  rigid.  Each  unit  was 
scheduled  for  a particular  time.  The  unit  marched  to  the  galley,  went 
through  a cafeteria  line,  and  then  individuals  seated  themselves  at  separate 
four  to  six  man  tables  of  their  own  choice.  Their  freedom  to  choose  was 
constrained  only  by  the  number  of  tables  already  occupied. 

Because  I thought  the  sharing  of  meals  might  also  provide  a clue  to 
the  development  of  informal  relations  observers  accompanied  the  unit  to 
the  galley.  After  it  had  passed  through  the  cafeteria  line  and  individuals 
had  chosen  their  tables,  the  observer  circulated  through  the  galley  coding 
the  subjects  by  table.  He  made  no  attempt  to  count  interactions  but  only 
to  specify  table  groups. 

LIBERTY  SECTIONS 

Each  unit  was  divided  into  two  liberty  sections.  On  alternate  days 
each  section  was  on  liberty  from  1830  until  2130.  The  entire  unit  was 
together  in  the  compartment  from  1530  to  1830  and  then  only  one  section 
was  in  the  compartment  from  1830  to  2130,  except  for  those  who  were  eligible 
for  liberty  but  chose  to  remain  in  the  compartment.  Some  evenings,  for 
disciplinary  reasons,  liberty  was  cancelled.  While  it  was  impossible  to 
observe  the  group  on  liberty,  it  was  possible  to  know  which  subjects  they 
were  and,  to  some  extent,  which  subjects  went  together.  Every  subject  who 
went  on  liberty  was  noted  on  a liberty  list  by  the  company  clerk  and  his 
exact  time  of  return  was  logged.  This  liberty  list  was  copied  at  the  end  of 
each  day.  A fairly  strong  assumption  was  made  that  if  subjects  A,  B and  C 
all  returned  at  exactly  the  same  time  that  they  had  been  on  liberty  together. 
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Interaction  was  not  frequent  in  the  classroom.  From  classroom  lounge, 
galley  and  liberty  list  only  grouping  could  be  inferred.  For  none  of  these 
were  frequencies  large.  Therefore,  these  data  have  only  been  used  to 
supplement  the  interaction  data  gathered  in  the  compartment. 

MECHANICS  OF  DATA  COLLECTION 

This  design  proved  practical  for  the  Navy  project.  Subjects  were 
contacted  when  they  reported  for  training.  The  project  was  explained  to 
them  and  their  informed  consent  obtained.  If  they  gave  their  consent 
to  be  part  of  the  study,  we  sought  their  permission  to  take  a Polaroid 
snapshot.  The  photo  was  labelled  with  the  subject's  project  identification 
number  and  name.  This  served  as  a learning  tool  for  observers  who  also 
spent  from  one  to  three  days  with  the  subjects  before  the  unit  was  officially 
formed  and  began  training.  In  this  way  the  30  to  50  subjects  were  individually 
known  to  the  observers  by  number  and  name  before  the  formal  data  collection 
began. 

During  the  approximately  14  days  each  unit  was  observed,  data  were 
recorded  by  both  machine  and  pencil  and  paper  techniques.  A prototype, 
solid  state  Datamyte'II  was  used  during  most  of  the  study  (see  Figures  14-15). 
This  device,  which  contains  an  automatic  clock,  has  a keyboard  which  permits 
entry  of  numeric  or  alphabetic  codes.  These  are  stored,  together  with  the 
times,  in  the  device.  At  the  end  of  the  observation  period  the  memory  was 
emptied  directly  onto  magnetic  tape  on  a computer  400  miles  away  through 
an  Execuport  portable  teletype  terminal.  The  data  were  cleaned  on  line. 

The  paper  and  pencil  data  were  a backup  in  case  of  failure  of  the  Datamyte  II. 
Figure  11  is  an  example,  not  only  of  the  observer  code,  but  of  Datamyte  II 
output. 


Figure  15:  This  is  the  normal  way  to  hold  the  Datamyte 
while  in  use.  It  is  lightweight  and  easily  grasped  with 
one  hand. 
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THE  CODE 

To  record  their  data  observers  used  an  alphanumeric  observational 
language  which  was  recorded  onto  the  memory  of  the  Datamyte  II  through  its 
keyboard.  Various  combinations  of  letters  and  numbers  were  the  vocabulary 
of  the  observational  language.  Each  "word"  in  the  language  was  constrained 
by  certain  ordering  rules,  somewhat  like  a grammar,  as  well  as  by  other 
conventions. 


Word 


Meaning 


Expl anation 


HHH, 


mmddyy, 

tttt. 


2.  CFH, 


tttt, 

3.  on*sn*ln. 


4.  Cnn, 
Fnn, 


Beginning  of  observer 
work  day. 


Date. 
T ime. 


Beginning  of 
observation  cycle. 


Time 

Observer  number, 
sample  number, 
location  number. 


Sample  area  number 
within  sample  cycle. 

Subject(s)  physically 
present  - no 
interaction. 


The  observations  were  made 
over  several  hours  each  day. 
Beginning  and  end  of  day 
codes  marked  the  boundaries 
of  the  entire  day's  data. 

Month,  day  and  year  the  data 
were  collected. 

Datamyte  II  contains  a clock 
counting  in  hundredths  of 
minutes.  To  relate  this  to 
a regular  24-hour  clock  as 
used  in  the  military  it  was 
necessary  to  enter  the  time 
of  day  by  hand. 

In  each  sampling  location 
(compartment,  classroom, 
classroom  lounge,  galley)  each 
complete  cycle  of  samples  had 
to  be  preceded  by  this 
indicator. 

Time  cycle  began  on  24-hour 
clock. 

Every  observer  was  given  a number 
Each  sample  cycle  was  assigned  a 
unique  number  (l...n)  from  the 
start  of  each  unit.  Compartment, 
classroom,  lounge  and  galley 
were  also  assigned  numbers. 

For  area  numbers  within  the 
compartment  see  Figure  10. 

Every  subject  physically  present 
but  not  interacting  in  the 
sampling  area  wa s code'  by  number 
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Word 

Meaning 

Explanation 

Hnn,  nn, 

Subjects  interacting. 

One  group  of  subjects  interacting 
follows  any  H. 

Hnn.nn, 

Another  group 
i nteracti ng 

Often  more  than  one  group  may 
be  located  in  the  same  area. 
Every  numbered  subject  within 
the  group  follows  the  H. 

Cnn, 

etc. 

Sample  area  number. 

The  next  area  in  the  randomly 
ordered  list  of  areas  in  the 
cycle. 

5.  CCC, 

End  of  observation 
cycle. 

After  the  ten  areas  are  sampled 
a cycle  is  complete. 

tttt, 

T ime. 

Time  cycle  ended  on  24-hour 
clock. 

6.  HFC, 

Liberty  list. 

After  sailors  return  from  liberty 
they  sign  in  with  the  times  of 
their  return  recorded  beside 
their  names.  For  clues  to 
leisure  companions  this  list 
was  recorded  by  subject  number 
and  time. 

nn*tttt. 

Subject  number  and 
time  returned  from 
1 iberty . 

HFC,  HFC, 

End  of  liberty  list. 

> 

End  of  observer' s 
work  day. 

This  must  appear  at  the  finish 
of  each  day's  data  collection. 

mmddyy. 

Date. 

tttt. 

Time. 

Specific  Indicators 

In 

Compartment  - 21;  Galley  - 22; 
classroom  - 23;  classroom  lounge  - 24 

F100 

Sample  area  empty. 

85 

Non-unit  apprentice  (repeat  as  needed) 

86 

Navy  person  (not  apprentice);  i.e. , unit 
counselor,  instructor,  etc. 

99 

Group  as-a-whole. 
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Appendix  3.1 


DETERMINING  SAMPLE  SIZE  NECESSARY 
TO  DETECT  GROUPS  FROM 
INTERACTION  FREQUENCIES 


by 

Kinley  Larntz,  Ph.D. 
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INTRODUCTION 

One  objective  of  a study  in  which  frequencies  of  dyad  interactions 
are  collected  is  to  identify  groups  (i.e.,  cliques)  that  exist  in 
the  study  sample.  A group  is  defined  here  as  a collection  of  individuals 
that  interact  at  a rate  higher  than  that  usually  expected  for  a random 
collection  of  individuals.  Of  course,  the  ability  to  identify  groups 
will  depend  heavily  upon  how  far  above  the  norm  the  group 
interaction  rate  turns  out  to  be,  e.g.,  a group  that  interacts  at 
ten  times  the  normal  rate  is  much  easier  to  detect  than  one  that 
interacts  at  ten  percent  above  the  normal  rate. 

In  designing  an  experiment  one  of  the  principal  questions  a researcher 
must  address  is  whether  the  study  has  a chance  of  accomplishing  its 
r objectives.  If  the  identification  of  groups  is  an  objective,  then  • 

it  is  important  to  determine  what  constitutes  an  adequate  sample  size 
(i.e.,  how  many  observations  are  needed)  for  such  identification.  This 
report  addresses  the  question  of  required  sample  size  when  groups 
interact  at  a rate  of  k (k>l)  times  the  normal  rate. 

NOTATION  AND  ASSUMPTIONS 

The  study  sample  consists  of  N individuals  all  of  whom  are  free  to 
interact  with  the  others.  The  random  pairing  interaction  model  assumes 
that  pairs  of  interacting  individuals  are  formed  at  random  but  allows 
for  differing  propensities  to  interact  (Tl^,  H^,***,  I ^,)  • For  the 
pair  (i,j)  the  relative  frequency  of  interaction  under  the  random 

Hi  nj 

r nt  ii j 

i<j 


pairing  model  is 
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If  n interactions  are  observed  the  expected  number  of  (i,  j) 
pairs  is  simply 


For  details  on  estimation  of  the  m under  the  random  pairing  model, 
see  Larntz  and  Weisberg  (1976). 


The  data  available  for  analysis  will  usually  consist  of  the  frequencies 
of  pair  (i,  j)  interaction  which  we  will  label  X.j.  Thus  ^ ^ X.  . = n 

is  the  total  number  of  observations.  Since  observation  usually  occurs 
for  a fixed  length  of  time  (rather  than  until  n counts  are  recorded),  a 
reasonable  model  for  the  counts  is  that  they  are  independent  random  variables 

following  a Poisson  distribution,  X.^  Poisson  (m..).  The  objective 

of  identifying  groups  is  translated  into  identifying  pairs  whose  Poisson 
rate  is  higher  than  the  rate  under  random  pairing,  namely  m^. 


A normal  approximation  for  the  distribution  of  X^  is  possible  when  m^ 


is  large.  Small  sample  improvements  have  been  proposed  by  many  individuals, 
but  one  that  has  proven  particularly  useful  is  due  to  Freeman  and  Tukey  (1950), 


d , . = lx. . + |X. . + 1 

ij  \J  ij  ij 


J 


4m^j  + 1 


N (0,1)  . 

For  the  calculation  of  sample  sizes  this  is  replaced  by  the  asymptotically 

I 


equivalent 


J 


ij 


ij 


N (0,1)  . 


However,  in  practice  the  group  determinations  are  based  on  the 


Freeman-Tukey  approximation. 
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Calculation  of  Required  Sample  Size 

N 

If  there  are  N individuals  in  the  study  sample,  there  will  be  (^)  possible 

pairs,  and  thus  when  .05  level  tests  are  conducted  on  each  pair  the 
overall  error  rate  becomes  much  greater  than  .05.  To  adjust  for  this 
problem,  a common  procedure  calls  for  making  the  individual  tests  at 


a reduced  significance  level. 


a = ir- 

V 


The  overall  error  is  then  bounded  above  by  a.  Thus,  "significant"  pairs 
will  be  identified  by  Freeman-Tukey  deviates  in  excess  of  ya,-. , where 

«,  x is  the  one-tailed  a*  critical  value  for  the  normal  distribution. 

Values  of  k for  N = 20,  40,  75  are  given  in  the  table  below: 


Adjusted  Normal  Critical  Values 


To  calculate  the  required  sample  size  for  a given  situation,  the  alternative 
hypothesis  and  the  power  requirement  must  be  specified.  To  illustrate  the 
calculations  we  will  use  N = 40,  a = .05,  required  power  of  .90,  and  an  alternative 
rate  that  is  k = 3 times  the  random  pairing  rate.  The  form  that  the  sample  size 
requirement  takes  is  calculation  of  m_.  Thus,  under  the  null  hypothesis 
(random  pairing)  will  have  rate  nf  while  under  the  alternative  the 
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Tables  of  required  m are  given  for  N = 20,  40,  75;  « = .10,  .05,  .01; 

k = 2,  3,  4,  5,  7,  10;  and  required  power  .50,  .75,  .90,  .95,  .99.  The 
value  computed  above  appears  as  the  table  value  for  N = 40,  a = .05, 
k = 3,  power  = .90. 
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All  of  the  computations  are  based  on  the  normal  approximation  which 


is  better  for  larger  values  of  g km^  than  for  small  values.  Thus, 
for  very  small  values  of  such  as  less  than  5,  the  power 

requirement  may  not  be  attained  exactly. 
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Power 


N = 75 

Size  of 
Test 

.50 

.75 

.90 

.95 

.99 

.10 

23.0 

31.5 

40.2 

45.9 

57.8 

k = 2 

.05 

24.9 

33.7 

42.6 

48.6 

60.8 

.01 

29.4 

38.9 

48.5 

54.8 

67.8 

.50 

.75 

.90 

.95 

.99 

.10 

7.4 

10.1 

12.9 

14.7 

18.5 

k = 3 

.05 

8.0 

10.8 

13.7 

15.6 

19.5 

.01 

9.4 

12.5 

15.5 

17.6 

21.7 
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NOTE:  See  Note  on  Pape  7. 
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